DOCUMENT RESUKE 

ED 269 244 SE 046 560 



AUTHOR 
TITLE 



SPONS AGENCY 

PUB TATE 

GRANl 

NOTE 

PUB TYPE 



MaiTShall, Sandra P. 

Errors in Processing Mathematical Information: A 
Cross-Sectional and Longitudinal Study of Individual 
and Sex Differences, Final Report. 
National list, of Education (ED), Washington, DC. 
Apr 86 

NIE-G-83-0048 
50p. 

Reports - Research/Technical (143) 



EDRS PRICE 
!)ESCRIPTORS 



IDENTIFIERS 



MF01/PC02 Plus Postage. 

Cognitive Processes; Cross Sectional Studies; 
Educational Research; Elementary Education; 
^Elementary School Mathematics; *Error Patterns; 
Grade 3 Grade 6; *Longi tudinal Studies; ^Mathematics 
Achieve^udnt; *Sex Differences 
California; ^Mathematics Education Research 



ABSTRACT 

This study investigated the rates of successful 
mathematical performance and errors of information processing in 
third-grade children in California and continued an investigation of 
these factors in sixth-grade children. The objectives were to: (1) 
identify areas of mathematics in which the children had recognizable 
strengths and weaknesses, (2) classify characteristic areas according 
to information-processing theor^ , and (3) relate the errors made by 
third graders in 1980 with errors made by the same ch: Idren as sixth 
graders in 1983. Approximately 25,000 children in each grade were 
tested for each ye^ir using data from the Survey of Basic Skills in 
the California Assessment Program. The tests and population are first 
briefly discussed, followed by the results of the analyses. For each 
grade level, discussion focuses on: correct performance by categories 
— computation, counting/number property, word problems (using 
operations), visual problems, geometry/measurement problems, and 
nontradi tional story problems; the most dif xio It items; scores on 
matched items (computation and applications); and errors. Then the 
longitudinal analysis is dicussed. Girls appear stronger th-^n boys in 
grade 3 and boys were stronger than girls in grade 6. References and 
sample items are included in the appendix. (MNS) 



************************************************** 

* Reproductions supplied by EDRS are the best that can be made * 

* from the original document* * 

*************************************** * ***«>***A***4>****4r«^4r*********** 



ERLC 



us DCPAITTMCNT OF EDUCATION 

o*tT« Of Educ«t.on«i Research and 'fnpfovemenf 

EDUCATIONAL RESOURCES INFORMATION 
CENTER (ERIC) 



^hih (Jocument has been f©pfo<Ji.»ced as 
f^ceived from the pe'son or Ofgam/altOn 
ortginaimg it 

C Mmor Changes have been mjcje to improve 
fepfoductiOn qualt*y 

a Points ot vie« or opmiOnS stated m tmSdOCu 
mem do r^ot necessarily repf-senf official 
OERi poS)t»on or policy 



^'^f®^^ ^" Processing Mathematical Information! 
A Cross-Sectional and Longitudinal Study of 
Individual and Sex Differences 



NIE-G-83-0048 
Final Report 



Sandra p. Marshall 
Department of Psychology 
Sa;i Diego State University 
£an Diego, CA 92182 



April 1986 



In 

[ ' o 
ERIC 



TABLE OF CONTENTS 



PROJECT OVERVIEW 1 

Objectives . 1 

TEST INSTRUMENTS AND POPULATION 1 

The Third-Grade Test 3 

The Sixth-Grade Test 4 

Population 5 

RESULTS OF THE THIRD-GRADE ANALYSES 7 

Correct Performance 8 

Categories of Items 8 

The Most Difficult Items 9 

Matched Items 11 

Analysis of Errors 14 

Description of the Different Errors and Their 

Corresponding I^istracters 16 

Language Errors 16 

Errors of Spatial Understanding 16 

Error?= of Mastery 16 

Errors of Erroneous Rules 16 

Errors from Lack of Attention 17 

RESULTS OF THE SIXTK-GR/.DE ANALYSES 19 

Correct Performance 19 

Categories of Items .... - 8 

The Most Difficult Items 21 

Mrtched Items 22 

Analysis of Errors 25 

RESULTS OF THE LONGITUDINAL STUDY 28 

Analyses Using the Entirp Population 28 

Correct Performance 28 

Comparisons of Errors 29 

Comparicons Based Upon Subset of Population 32 

Correct Versus Incorrect Performance 32 

Persistence in Making the Same Error 37 

SUMMARY 39 

REFERENCES 42 



APPENDIX A: Sample Problems from California Assessment 

Program Tests for Third and Sixth Grades ... .43 



ERLC 



LIST OF TABLES 



Table 1: Description of Iteras: Third Grade 7 

Table 2: Correct Performance on the Survey 

of Basic Skills: Grade 3 8 

Table 3: Average Correct Performance of Boys 

and Girls on Matched Items: Third Grade. . .12 

Table 4: Distracter Analysis: Third Grad^i - 15 

Table 5: Description of Items: Sixth Grade 19 

Table 6: Correct Performance on the Survey 

of Basic Skills: Grade 6 20 

Table 7: Average Correct Performance of Boys 

and Girls on Matched Items: Sixth Grade. . .22 

Table 8: Distracter Analysis: Third Grade 26 

Table 9: Comparisons of Distracter Choices: 

Third and Sixth Grades 31 

Table 10: Specific Content Areas Present on 

Third-Grade and Sixth-Grade Tests 33 

Table 11; Com'>arison of the Relative Gains by Boys 

ar.'i Girls from Third to Sixth Grade 35 

Table 12: Comparisons of Boys" and Girls" Responses 

tc) Matched Pairs of Items 36 



ii 



LIST OF FIGURES 



Figure 1: A Comparison of the Most Difficult 

Items for Boys and Girls: Grade 3 lO 

Figure l\ Performance of Boys and Girls on Matched 
Items of Computations and Applications: 
Grade 3 13 

Figure 3: A Comparison of the Most Difficult 

Items for Boys and Girls: Grade 6 23 

Figure 4: Performance of Boys and Girls on Matched 
Items of Computations and Applications: 
Grade 6 24 

Figure 5: Relative Change in Boyo ' and Girls' 

Performance from Third to Sixth Grade. . . .3C 



iii 

ERLC 



PROJECT OVERVIEVJ 



The present project was an investigation of rates of 
successful performance and errors of information-processing 
in third-grade children and a continuation of investigation 
of these issues in sixth-grade children. Three )bjectives 
'^ere specified for the project. Each is described below, 
together with a brief statement describing success in 
reacning the objective. 

Objectives 

The objectives were: 

(1^) To identify areas of mathematics in w hich third - grade 
and sixth - grade girls and boys have recognizable strengths 
and weaknesses . 

At the third grade, girls performed significantly better 
than boys in the areas of arithmetic computations, 
principles of counting, and nonstandard roblem solving. At 
the sixth grade, boys had significantly more success than 
girls in solving geometry/measurement problems and 
traditional word problems. Girls maintained their advantage 
for arithmetic computations. 

(2^) To classify characteristic errors mad e by either sex at 
the two grades according to information - processing theory . 

At both grades, boys were more likely than girls to make 
errors related to usage of erroneous arithmetic rules, 
including errors of number fact, errors in using algorithms, 
and errors of confusion with horizontal problems. They were 
also more likely to select the opposite semantic category 
(e.g., respond with the greatest rather than the least value 
when the least was required). Girls were more likely than 
boys to make errors of association, e.g., focusing on 
particular words in the problems and using inappropriate 
rales such as adding all numbers in the problem. Both boys 
and girls made errors related to attention, with boys more 
likely to make careless errors of transcription and girls 
more likely to omit a step of the solution. 

^1^ l£ relate the errors made by a large sample of third - 
grade children in 1980 with ^ the errors made by the same 
children as sixth graders in 1983 . 

There is substantial improvement in children's mathematical 
performance from third to sixtii grade. However, a large 
number of children who failed to solve particular problems 
at the third grade remained unable to solve similar problems 
at the sixth grade. Girls were more likely than boys to be 
incorrect on items at both grades. 

The data studied were responses cc standardized 
achievement tests taken by all California third and sixth 



grade children enrolled in public schools. Approximately 
250,000 children at each grade were tested for each year 
studied. The data were gathered by the California Assessment 
Program of the California Department of Education. 

This research provides new information about the nature 
of errors made by elementary school children. Children's 
responses were examined in the context of cognitive skills 
and information processing. A more usual method of research 
has been to study only correct performance within narrowly 
defined subfields of mathematics such as geometry or 
arithmetic. Emphasis in this study was on cognitive 
behaviors — correct and incorrect — that apply over many 
different subfields. An advantage of a lai^'e study such as 
the one carried out here is that many subfields of third and 
sixth grade mathematicF could be studied simultaneously. The 
evaluation of a large number of children's responses to a 
large number of items provides information about 
simi]arities and differences in children's problera solving 
at two important ages. 



TEST INSTRUMENTS AND POPULATION 



Responses of third-grade and sixth-grade children to 
grade-level standardized tests were examined. The tests are 
the Surveys of Basic Skills, Grades 3 and 6, administered 
annually to all third-grade and sixth-grade children 
enrolled in public schools in California. The tests were 
developed by and are administered under the California 
Assessment Program (CAP) of the California Department of 
Education. Ti'^e tests assess reading, written expression, and 
mathematics performance. Additional details may be found in 
the California Assessment Program Annual Report (1983). 

These tests were designed to assess the average 
performance of children at school , schoo] district , and 
strite levels. Individual results are not released to the 
schools or to the students . A variety of iteTns are included 
in the tests, and the objective is to evaluate a large 
number of separate concepts identified from the curricula of 
third-grade and sixth-grade mathematics. 



The Survey of Basic Skills, Grade 3, contains 360 
mathematics items. There are 30 distinct test forms, and 
each contains 12 math items. Each student responds to a 
single form. The tests are not equally difficult, and the 
items on each form usually test different concepts. Seven 
areas of mathematics are evaluated by the Survey: 



In all but the last category, at least two types of 
problems, "skills" and "applications" , test the concepts . 
Skill items are simple computations. Applications are word 
problems requiring the identification and use of skills for 
solution. 

An additional feature of the Survey is the inclusion of 
matched pairs of skill and application items using tne same 
numerical values and having the same set of distracters. For 
example, the items below are matched: 



li'tQ Third-Grade Test 



arithmetic operations 
counting and place value 
number properties 
measurement 
geometry 

patterns and graphs 
nontraditional word problems 



155 items 

45 items 

45 items 

40 items 

30 items 

30 items 

15 items 



78 
+ 45 



( ) 33 
( ) 133 



( ) 123 
( ) 111.3 



Jenny baked 45 cookies • 
Then she baked 78 more. 
How many cookies did she baks? 



( ) 33 
{ ) 133 



( ) 123 
( ) 1113 



There are 32 pairs of matched items on the test. 

The mathematics section of the third-grade test was 
first administered in May 1979 , and has been ^iven every 
spring thereafter. In this research project, we evaluated 
responses from the 1980 administration. 



The Survey of Basic Skil Is , Grade 6, is similar in 
design to the third-grade test. The first test of the sixth- 
grade level was administered from 1975 through 1981 and 
contained 160 mathematics items in essentially the same 
content areas as those described for the third-grade test. 
It also contained items of probability and statistics. The 
test was revised and expanded in 1981. It currently contains 
480 items distributed in the following categories: 



There are 12 pairs of matched items on the sixth-grade 
test. For example: 



Tne Sixth-G.:ade Test 



arithmetic operations 14*^ 

counting and place value 4 , 

number properties 50 

measurement 58 

geometry 40 
equations and coordinate graphs 42 

tables and charts 30 

probability and statistics 23 

nontraditional word problems 52 



items 
items 
items 
items 
items 
items 
items 
items 
items 



0.5 + 0.03 = 



( ) 0.008 

( ) 0.08 

( ) 0.53 

( ) 0,8 



A paper clip weighs 0.5 grams. A piece of paper 
weighs 0.03 gram. How much would the paper and 
the paper clip weigh? 



( ) 0.008 

( ) 0.08 

( ) 0.53 

( ) 0.8 



The revised test was first administered in hxay 1982 and 
has been given annually since that time. Individual student 
responses to the 1983 administration were used in this 
project. 

It should be noted that the children responding as 
third graders in 1980 were sixth graders in 1983. Therefore, 
the responses to the sixth-grade test in May 1983 are doubly 
valuable: rhey provide information about sixth-grade problem 
solving in general and they also contain longitudinal 
information about the development of problem-solving skills 
and use of cognitive processes from the third to the sixth 
grade. 

Population 

Every rhird-grade and sixth-grade child enrolled in 
public school in California responds to the standardized 
tests described above. Approximately 250,000 - 300,000 
children at each grade are tested annual ly. The population 
varies by sex, Dy age in months, by the primary language 
spoken at home, by geographic location, and by socioeconomic 
status. These student characteristics are collected for each 
individual together with item response^:*. 

Responses from all students were examined in the 
initial comparisons. The results of these investigations are 
reported in the second and third sections of this report. A 
subpopulation was identified for the longitudinal study, 
discussed in section four. For this subset of data, 
attention was restricted to children enrolled in the same 
school at grades three and six. This enrollment information 
is routinely gathered by tne California Assessment Program 
when the sixtn-grade tests are administered. 

The California Assessment Program makes student 
identification tnrough tlie personal characteristics 
described above, namely, sex, birthdate, primary language, 
and ethnicity. CAP does not record student names (since 
individual test scores are not released). Therefore, the 
process of matching third-grade and sixth-grade responses 
for individuals was based upon these same personal 
characteristics. For each school, the third- and sixth-grade 
individuals were matched according to sex and birthdate. The 
estimates of primary language and socioeconomic status were 
not used as matching variables. These responses were 
^^stimates made by tne teachers at each grade. It was feared 
that teachers from grade to grade might differ in their 
estimation of children's socioeconomic background and of the 
language spoken most often at home. It is also possible that 
one or both of these variables might have changed witnin the 
three year period from third to sixth grade. The variables 
of school , birthdate , and sex were invariant over this 
period. 



We were able to locate full test data at botii third and 
sixth grade for roughly 100,000 students. Our final subset 
of data contains responses from children enrolled in 
elementary schools that span third throu(jh sixth grade (at 
least). In the initial population of 300,000 third grade 
students, about 150,000 students were in elementary schools 
that covered only kindergarten through fourth or fifth 
grades. These students then moved to middle schools 
containing grades six through eight. We had no :neans of 
matching feeder elementary schools with middle schools and 
thus were unable to follow these children. The remaining 
students not in our matched subset were students who failed 
to give full demographic information at one or both of the 
test administrations or students in the same school having 
identical personal characteristics. This final criterion 
meant that identical twins or fraternal twins of the same 
sex were excluded, since they manifested identical 
demographic data. 



1 i 



RESULTS OF THE THIRD-GRADE ANALYSES 



For the analyses described in this and subsequent 
sections of the report, the test items from the Surve y of 
basic Skills ; Grades 2 and 6 were evaluated according to^'six 
categories that were common to both tests. Consequently , 
some of the items (e.g., probability items from the' sixth- 
grade test) wero not analyzed because they occurred only at 
a single grade level. The categories used here are given in 
Table . 



Table 1 
Description of Items 



FREQUENCY OF 
OCCURRENCE: 

CATEGORY LABELS THIRD GRADE 

( 1 ) Computations 115 

(2) Counting ^nd Properties of Numbers 87 

(3) Word Problems 44 

(4) Visual Proolems 57 

(5) Geometry axd Measurement Problems 39 

(6) Nontraditional Story Problems 18 



Some of these categories differ from those used by the 
California Assessment Program. The category of computations 
refers to problems given in traditional equation or 
expression form for which the student must carry out the 
indicated operation ( s) . Counting and number properties irems 
are thr^e that require the student to demonstrate knowledge 
of concepts such as even/odd, series, and place value. Word 
problems are traditional story problems in which one or more 
arithmetic operations are embedded. The category of visual 
problems contains all problems with a visual component, such 
as charts, graphs, or diagrams (excluding problems of 
identification of geometrical shapes). Geometry and 
measurement problems are discussed as a single category 
because of the overlap betv;een these two types of problems 
in elementary school. The final category contains 
nontraditional story problems that require the student to 
make a noncomputational response. For example, problems in 
this category may request identification of the facts 
required to solve the problem, identification of a 
restatement of the problei.., or recognition of a sir liar 
problem. Examples from each category may be found in 
Appendix A. 



7 1^ 



Correct Performance 



Categories of i tems 

In general, the third graders performed quite well on 
the first administration of the Survey of Basic S kills ; 
Grade 3. A summary of their overall rate of success by sex 
is given in Table 2. 



Table 2 

Corre^c Performance on the 
SURVEY OF BASIC SKILLS: GRADE 3 

Percent Correct 



Area 


Boys 


Girls 


Computation 


72.12 


74. 20 


Counting/Number Property 


71.45 


72. 88 


Word 


6 i.94 


64.65 


Visual 


75.48 


75.93 


Geometry /Measurement 


65.86 


66. 4S 


Nontraditional 


66.63 


69. bO 



For both boys and girls, word problems were the most 
difficult items of the test and visual problems were the 
easiest items . A rank order of the categories is identical 
for the sexes. From easiest to most difficult they are: 
visual, computation, counting and number property, 
nontraditional, geometry and measurement, and word problems. 

Comparisons were rade to determine whether the 
probabilities of success for each category differed within 
each sex. That is, were boys equally likely to succeed on 
computation or word problems, or were there statistically 
siwjiiif icant differences between the rates of .7229 and 
.6494? For boys, the rates of success over all categories 
differed significantly from each other with three 
exceptions. Counting and computation items shov^ed no 
difference, and the two categories of word problems and 
nontraditional problems did not differ from 
geometry/measurement items . There were significant 
differences between word problems and nontraditional items. 
For girls, all categories were significantly different from 
each other. 

There appear to be two patterns for boys' and girls' 
success rates over these categories. For boys, there are two 
groups of items, one group containing word probleiiis, 
nontraditional problems, and geometry /measurement 

1.J 

8 



significantly easier for boys than the other three 
categories. A different pattern emerges for girls. Like 
boys, they found computations, counting/number properties, 
and visu.il items to be easiest. However, the category of 
nontraditional items does not group with the other two 
categories of word problems and geometry/measurement. 
Instead, these latter two categories form a difficult group 
similar to that observed for boys. The nontraditional items 
comprise a third group of intermediate difficulty. 

Although they were identical in the rank order of 
category difficulty, boys and girls differed in ^he degree 
of difficulty associated with each category. For each 
category, the probaoility of success by boys was compared 
with t .^t of girls. Three of these comparisons were 
statistically significant beyond the usual .05 probability 
level: girls had higher levels of success on computations, 
items of counting and numoer properties, and nontraditional 
problems. They were also marginally better ".n solving items 
of measurement/geometry and visual items. Boys were not 
significantly more successful than girls over any category, 
although they demonstrated slightly higher rates of success 
for the word problem^s. 

The Most Difficult Items 

A second analysis provides information about which 
particular item»s wer3 most difficult for boys and girls. The 
53 items haviiiq the lowest p-values were identified for each 
sex (i.e., the most dif/icult items). As one might expect, 
a large majority of tho&e that caused difficulty lor one sex 
also caused similar trouble for the other sex. However, 
there were seven items that appeared on the most difficult 
list for boys that did not have similar difficulty for 
girls. Thus, on 14 percent of the most difficult items, boys 
and girls did not agree. Six of these seven items were 
arit^^'ietic computations; the seventh was a nontraditional 
i^..a requiring identification of the question asked in the 
problem. Three of the six computational items were 
multidlgit subtraction items, one was a simple 
multiplication problem, and the remaining two were 
horizontal multiplication problems involving only single 
digits . 

The items that were difficult for girls but not for 
boys were three word problems, one visual problem, and two 
Items requiring multiplication of 10 or 100 . This last 
weakness has been noted before (CAP, 1981). While girls are 
consistently better able than boys to answer problems of 
simple ar-.thmetic computation, they have difficulty ^^hen the 
numbers are multiples of 10. We have no explanation for 
this finding. 



9 




5 10 15 20 25 30 35 40 45 5( 



Rank order by Boys 

C Oonputation V Visual 

K Counting/Nuniber Property M Geometry /Measuronent 

W Word N Nontraditional 

Figure 1: A Ccnparison of the Most Difficult Items fox Boys and Girls: Grade 3. 



10 1^ 



The r-suJts of comparing the most difficult items for 
each «\ are consisten't with the. 1 Yr U "li?h 

fo^Konai-^^^Ls'^jra Jas h^il^^or^Lf ^^02.^ t^an .o.s 
with word problems. 

There remained 43 common items ^^J^^^^^^'^^^.Z^lZl 
^nci airls Each of these had an assigned rank from the above 
Ustfif o. "he' 50 most ^i^-ult items for boys ana 
These ranks were comparea ^^ing the ^^^'l'^^^ ^^^^^f 
product-moment correlation coefficient. Tne degree o 
Similaritv for these two sets or ranks may be seen in tne 

Ss^aV?L flVna^; sl^ritf trhale^L^J 

b-^es t^^a't'^^^^e-are-a-^^'stan^a. o°f' othVr .lilt 

in the common se' which have different ranks. 

nf the 43 common items falling into the 50 most 
dif fi?ult"ite.t for Toth boys and girls di«|-\-,^:^^ 
order by more than 10 places. For "^"Plj-'^he item that was 

5-1-Tnt -ft t^y^ri^^S^lS^'^ 

re^rbfta:S:!i/more°difficult were word P-"- T«o o 
the six consiaered to be difficult oy uuy^ 
computatir^nal items. 

Figure 1 contains a summary of these ^ j^^^'J^J- 
fo! =bTs°i/cn"t>:e^'LI[na?e^°^rd%h^tn^/r ^frYs%s 

r diSt-^ ^^^"^^ -l—if b 

^.r'^' b'o^sruems 7rl ^ea^c^i^cVterorr a^e 'lUs 
quadrant. 

Matched Items 

Therp are 25 pairs of matched items on the this test. 
Each paYr%\^\a/ns' a computation ^eVo? 
application item requiring the same sKill. For th.s set or 
items, we get the results of Table 3. 



ERIC 



Table 3 



Average Correct Performance of Boys and 
Girls on Matched Items: Third Grade 

Computations Appl ications 
Boys 80.76 73.49 

Girls 80.91 72.67 



For both sexes, the difference in performance on 
computations and applications is statistically significant 
(p < .05). There are no differences between boys and girls 
on either type of item. 

The relationship between computation and application 
performance is more clearly observed in Figure 2. Each pair 
of items is plotted. Boys" performance is indicated by the 
symbol X and girlj" performance is given by 0. As can be 
seen in the figure, there appears to be a liiiear 
relationship for boys and for girls between performance on 
the two types of items. Regression equations for eacn group 
were developed. For boys, the equation for predicting 
application performance (A) from computation performance (C) 
is : 

A ^ 1.116C - 16.656. 

For girls, the corresponding equation is: 

A ^ 1.096C - 16.000. 

Tests of both regression equations were significant, 
indicating that the regression of applications on 
computations is significantly different from zero (F = 
43.00, df^ 1,23; F 28.60, df = 1,23; p < .001 for both). 
A comparison of the regression coefficients in the two 
equations was nonsignificant. The relationship between 
computations and applications is the same for boys and for 
giris . 

These statistical tests suggest that for all children, 
there is a reasonably constant relationship between 
p'srformance on computation items and performance on 
corresponding application items. The difference between the 
two is large. Performance on l?oth is measured on the same 
scale, percentage correct. As can be seen in the comparison 
of means in Table 3, performance on applications lags behind 
performance on computations by approximately eight 
percentage points. One concludes that students know hov; to 
compute successfully many different types of problems 
(computations), but do not know when these computations are 
appropriate (applications). 



12 1/ 




30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% 

CJanputations 

X Male 
O Penale 



Figure 2: Pferfontance of Boys and Girls on Matched Iteins of 
Cbnputations and Applications: Grade3. 



IS 

,13 



Analyses of Errors 



In previous research, we have found that boys and girls 
have tendencies to make different types of errors on 
mathematics problems. In an earlier project funded by the 
National Institute of Education (Grant No. NIE-G-80-0095 ) , I 
developed a classification of errors based upon the 
cognitive processes used (Marshall, 1982; 1983). The 
classification had the following categories: 



This classification was used here as well, but the category 
of "Irrelevant Rules" was replaced by a category of 
"Lack of Attention." 

Under each of the six general types of errors listed 
above fall many distinct hypotheses about children's 
performance. In addition to the hypotheses formulated and 
tested in the earlier project, many new hypotheses have been 
proposed and evaluated here. In the original classification, 
many hypotheses about errors could not be tested because of 
the limited number of test items (160) and the inappropriate 
set of distractero for many items. In the current research, 
there are 360 items at the third grade and 480 items at the 
sixth grade. Most of these items have reasonable and usable 
distracters . 

The types of distracters evaluated in separate 
hypotheses are given in Table 4 together with tneir parent 
category. The list of errors in Table 4 is a result of 
theoretical considerations and empirical assessibility . We 
began with an assessment of the categories and types of 
errors studied in the previous NIE project (Marshall, 1982). 
We then examined all items on the third-grade and sixth- 
grade tests . Our examination yielded several additional 
errors that could be evaluated, particularly in the category 
of erroneous rules. There are undoubtedly other erroneous 
rules that students use in solving mathematics problems. The 
ones evaluated here are those with distracters corresponding 
to the errors . 



I. 
II. 
III. 
IV. 

V. 
VI. 



Language 

Spatial Understanding 
Mastery 
Association 
Irrelevant Rules 
Erroneous Rules 



Table 4 



Distracter Analysis 





NO. OF 


ITEMS 


NO. OF TIMES 


NO. OF TIMl 


TYPES OF DISTRACTERS HAVING 


THIS 


BOYS MAKE 


GIRLS 


MAKE 




DISTRACTER 


ERROR MORE 


ERROR 


MORE 








THAN GIRLS 


THAN 


BOYS 


I. 


LANGUAGE ERRORS 


22 


12 


10 




A. 


T.i ^ r T'l^ancl one 
1 1. dii o i. d L J. uii o 


u 


1 


5 


★ 


B. 


Opposites 


16 


11 


5 


★ 


II. 


SPATIAL UNDERSTANDING 


14 


1 n 


4 


★ ★ 


A. 


k-' ^ t* ^- J. t* X r\cvcx.ocixo 


1 4 


10 


4 




Ill . 


MA^TFRY 


118 

X X u 


59 


59 




A. 


WronQ Ooeration 


118 








IV. 


A^^OCTATTON 


25 




22 


* ★ ★ 


A. 


Key Words 


11 


1 


1 0 

X V/ 


★ ★ ★ 


B. 


Number Patterns 


14 




12 


★ ★ ★ 


V. 


ERRONEOUS RtlT.F^ 


132 

X ^ ^ 


106 


26 


★ ★ ★ 


A 


n't" r a r» "t" Qma 1 1 f mm T.a rrr^ 
ouuui. u Oiiid 1.1. 1.1. yjiii j.jci i. ^ c 


Ifi 

X o 


13 


3 


★ ★ ★ 


B 




1 n 

1 u 


Q 
O 


0 




c 


Add All Digits 


18 




A 
•1 


★ ★ ★ 


D. 


Expand Columns 


14 


1 J 


1 

X 


★ ★ ★ 


E. 


^UJ.A IWIJ V^^CJl. CI U J. Ull 9 


1 4 


C 

D 


9 






D\J 1 1. yJY/ 1. liy^ £ilL\Jla 


1 ^ 
1 3 


12 


3 


★ ★ ★ 




Carrvina Rrrnre 


1 7 


17 


n 


★ ★ 


H. 


Concatenations 


28 




A 
•i 


★ ★ ★ 


VI. 


LACK OF ATTENTION 


76 




42 




A. 


Omit a Step 


7 


4 


3 




B. 


Careless Transcription 


21 


14 


7 




C. 


Lack of Perseverance 


15 


3 


1.2 


★ ★ ★ 


D. 


Interference : Series 


16 


6 


10 




E. 


Partial Reading 


17 


*/ 


10 






* .10 < 


P < • 


20 








.05 < 


P < • 


10 








★ ★* 


P < . 


05 







20 

15 



Description of the Different Errors and Their Corresponding 
Distracters » 

Language Errors . Tiiere are two errors under the 
category of "Language", (a) literal translations and (b) 
opposites. Literal translation refers to the errors made in 
translating words directly into numbers (and vice versa) 
without regard to place value information. For example, an 
error of this type would be the response of 30046 to the 
question: "How would you write three hundred forty-six". 

Errors o^ opposites refer to confusions in semantic 
understanding. An example is the response of the least value 
when the largest is requested. 

Errors of Spatial Understanding . Only a single error 
was evaluated at the third grade, the error of spatial 
reversals. Spatial reversals are responses that confuse 
spatial orientation of top and bottom, left and right. 

Errors of Mastery . There is a single type of error in 
this category, application of an incorrect arithmetic 
operation . 

Errors of Erroneous Rules. At the third grade, this is 
the largest category of errors. Apparently many children 
have not yet mastered the correct procedures for carrying 
out arithmetic operations. The first error of Table 4 is 
that of subtracting the smallest value from the largest 
valuo without regard for where the values are placed in the 
problem. Thus, 24 - 18 = 14, by this erroneous rule. 

The second error of this class is the reversal of left 
to right in placing numbers. For example, when asked to 
write the number three hundred forty-two, a student might 
respond with 243. 

The third error is that of adding all digits in the 
problem. Given the addition problem of 25 + 16, a student 
following this rule sums the digits 2, 5, 1, and 6 for a 
response of 14. 

The error labeled "expanding columns" refers to 
addition of two or more columns as if each column were 
independent. Under this rule, one gets > he following 
response to the addition problem of: 

76 
89 

1515 

The error of mixing two operations occurs wnen a 
student begins to apply one operafion such as addition and 
then switches to a second procedure such as multiplication 
within the same problem. 



2l 

16 



* borrowing and 
are those ",,,-^5 entirely 
^ two errors .^"^l^^^e procedures 
The next ^^°^.^v,ei omit tnes. 

Of concatenation. ^^^^ digits Pre ^^^^^^^ ^sing 
^^fVhf addition o£ 



the addition oi „tegory 
^^'add^^tion. Thi%P\"Tn an interview 

^ixth-grade students presumably 

T^fu' %%\"oroi =--r„ 

rs^^n alten^ion. ^sptnain, 163 rathe. 

£ lac. o£ perseverance as needed 

The ?^ies or procedures rep ^^^^ reflects ^^^^ . ^ 

rTV^^uV. an in' a P-^^- 

oroblera. ^YV final ^^^^1 -formed . , - 

^ -,^.^rv out tne ^.,st perioi"' ^^nce of « 

^° ""J'i^l -o the one ^ust: f Interference 

;l:r error^o.^at..icn,; ;„^d Aj^e^^a. 
Known pattern «>.th th p„ble-s /t _^^,,. Given | ^ 

problem. »°=UntitVing r° "^""^L^behavior interiors 
problems ^"^^ ?, al counting be 

fefpon^e '^ ^'fTllli^^^ M ^- ,«tial 
«.Uheproce^° of -%^:n^d^r at^^VhrprSl^e^ 

The final error ,^ type the P 

-ni-tfe-slUd Proble.. ^ 

There «ere 3 f^-./e there7 

HOW many P>iPP^ , - 3 puppies- 

.S'A.*. ir v. -."S^ - 

^°Tr consider i^-^^Veach type of e^/° the ^ ^^.^^ 

Vq in i^otal. J°%!ror could ^e .^^ than 9^"^^^.^ girls 
times the error ^^^'^^^of times that g 

number of/^^f times boysj'e^^^i.^g number of ^ach err 

the number ot correspond ^^^^ hypo^-^^si 

'more'UKely to err. Th 
vjere more 



was that boys and girls were equally likely to err. Of the 
distinct t'3Sts^ recorded in Table 4, nine yielded 
probabilities smaller than the usual .05 level of 
sionificance These are indicated by *** m tne table. Thus, 
signiticance. inebe c lively to make the same 

boys and girls were NOi equaiiy j.j.js.cxy v. ^^cin-c 
errors on rouqhly half of tne errors identified. The results 
of five additional tests were marginally significant 
{between .05 and .10). 

Talole 4 also gives the aggregation of errors within 
Darent cateqories. Comparisons of these are revealing. In 
^rrtfcula'r^%Yr?s%ere clearly ..ore li-^^^y ^^^^ ^^^^^ "^^Tn 
errors of association while boys 

girls to make errors using erroneous rules, further, it one 
deludes the error of careless transcription from the 
c^?egory of attention, girls were also significantly ^ore 
^iK^ly than boys to make errors related to attention. 



ERIC 



2,1 



18 



ERIC 



RESHLTS OF THE SIXTH-GRADE ANALYSES 

Most c£. the ^^^^^:^^^^T.^'Z\r^^^^^^^^^ 
same categories as those of the tnira 9 ob^buitv and 

=?reI'""Thr^dis\^tbu\i'orori?e^fov\r oate'gories is given 



in Table 5. 



Table 5 
Description of Items 



CATEGORY 



FREQUENCY OF 
OCCURRENCE : 
SIXTH GRADE 



86 

(1) Computations ^ „ u ^ in4 

(2) counting and Properties of Nuirbers 104 

(3) Word Problems ^2 

(4) Visual Problems 

(5) Geometry and Measurement Problems // 

(6) Nontraditional Problems 

(7) Probability g 

(8) Algebra 



The last two categories -U not be discussed further si^^^^ 
thev have no counterpart at the thira g^a'ij; 
these categories were excluded from all analyses. 

Correct Performance 

Tne level of difficulty ol the sixth-grade test f i"^^^ 
from thlt oT'the third-grade test f^^f-.^^Sfrafg: 
the third grade ranged ^^om 64 94% to 75.93% The ^ 

fo:er"frorr7!4r% ?-r8^ 1 7%!"TLf enTntlinl a sum.a?y of 
the correct performance by boys and git is. 

.s in the third S^de. the jost aifficuU i--^here 
lr,ni"'^fan?r/re ^dlffic^^t J-n^her ^e.s However 

gi?ls and boys ^^"-^f'^.^lTty for boys f rom ea"es? to 
difficulty. The order of '^i** ^^^''^ °%l„ber properties, 
hardest is: computation, """/„"a For 

geometry/measurement ""^traditional^ and word^^p^^ . ^^^^^ 

girls, the order -..^^ aeometry/measurement , 

properties, 7^^^^^' '""^J"^^^^^^^^^ proolems, 
and word problems. With the excep^j-u third grade, 

girls maintained the oraer found ^t the tmra g 
^though the rank order for ^^l^^^^^^^^^^^J^^^^^^^^ and 
r'^^^^j' '^^ft ^^t^w°ee\^^^^/.rmrtry%ersfrLent and 

19 24 



nontradit^onal items, inspection of Table 6 shows that these 
percentages are very close. There is also little difference 
for girls between the categories of visual and 
counting/number properties. Howeve.', there is substantial 
difference between nontraditional and geometry/measurement 9 
problems, witn the former being less difficult. j 

I 

I 



Table 6 

I 

Correct Performance on the 
SURVEY OF BASIC SKILLS: GRADE 6 

t 





Percent 


Correct 


Area 


Boys 


Girls 


Computation 


65.84 


68.17 


Counting/Number Property 


65.66 


65.47 


Word 


59.19 


57.48 


\/isual 


65.66 


65.32 


Geometry /Measurement 


63.90 


61, 89 


Nontraditional 


63.74 


64.47 



Comparisons among categories for each sex yield 
different patterns of difficulty for boys and for girls. The 
categories of visual, counting/number properties, and 
computations have essentially the same difficulty for boys, 
and the categories of nontraditional and 

geometry/measurement items are also 

indistinguishable. These last two are significantly more 
difficult rhat the other three. Finally, word problems are 
significantly moxe difficult than the pair of nontraditional 
and geometry/measurement items. The significance level used 
for all tests of categories was ^05. 

Girls demonstrated a different pattern of item 
difficulty. Computation items were significantly easier than 
any other category. The three categories of nontraditional, 
counting/number properties, and visual items had essentially 
the same level of difficulty. These were significantly 
easier than the category of geometry/measurement , and the 
latter was itself significantly less difficult than the most 
difficult category of word problems. Again, the significance 
leval was .05. 

Comparisons were also made between boys and girls for 
each category. Recall that at the third grade, girls scored 
significantly better than boys on computations, 
counting/number properties , and nontraditional items . At 
the sixth grade, girls continued to outperform boys on items 
of computation but lost ground in four areas. First, they 
lost the advantage demonstrated at third grade in the areas 
of counting/number properties and nontraditional items. 



20 



26 



V 1^ 11 th- sixth qrade. Second, ooys mover; 

the areas or woru t'j.wwiv. at the sixth grade, 

suoerior performance in tnese two areas ^.^..^l^^ ^0-^) 
?SeIe differences were statistically signitican. (p < .0.). 

Ihe Most Difficult Items 

analysis s^U^ lV::^ri.i\Td'Tnt°To 
section was carried out with the sixtn graa answered 
items having the lowest .^'^^^^^^^^^^^ ^^.^^^^^^^ As was 

correctly were id.«^^^„^,\^^^/°"here^ overlap in the 

found at the earlier grade, there is a i g ^^^^^^ ^t 

'^^^h^'f a?aL therf werr43 co^^^^^^ A? the sixth 

the third grade, tnert • Th** rank correlation for 

grade, there are 40 cormnon ^tems ".f^ "°a^ne value as 

these 40 items was .79, essentially 

before . 

are five computations, two 9eo="e"y/-"f ^^s ^^^^^^^ „ith 
visual iten,s, and one word P^°bUm rhis is con^ ^...^^^^^ 
the findings at the third grade. ^nai^g Ust of 

with computations than do girls The <:o-.^« P ^^^^f^^ toys 
10 items that are ""-^^ difficult for g __^^^^ ^^^^ 

ItTliZ. rntin\7pro;erties°^f=\J,^nd visual items, 
and a single computation. 

- examination of the^i^^^^nces 

between boys and girls f ^^^^f ^ for an additional 
difference fell between 10/^^^^^l/33P°^5^",;d lo' The remaining 
9 items, the ^\^^,^^f^^j^?iaur the same by boys and girls 
20 items were ranked essentially 
(i.e., were within 5 ranks). 

T^- is useful to examine those items with rank 
It IS usetui ^o items. Seven of 

difference greater than 5. There are ^^^^^ 

them (35%) ^-^^^/j^.r U^^^^^^ rank indicates greater 

ranks on six of the seven J ^° countinq/properties of 

difficulty). Four of the items were ^^^^^^^^^^^ /i^^it than 
numbers. Boys found all ""^ord problems and 2 

did girU. Boys . "^^=J°/ ^.^^^^ower than the ranks from 
nontraditional problems were also lowe^^^ .^^^^ ^.^^ ^^^3^ 

girls performance, rhere were i^ems showed no 

diffsrence greater than 5. Tne computauj. 

pattern. 

Fioure 3 shows the 40 items that were most difficult 
for bot^h^boys and ^irls. As before thos^^^-^^^^^^^^ 
northwest quadrant were relatively more airii^u 



ERIC 21 



2d 



than for boys; those in the southeast were relatively more 
aifficult for boys. Most of the items in the former are 
93ometry/measurement problems- tiany of those in the latter 
are counting or computation items. Again, these observations 
are consis ent with previous findings related to girls' and 
boys' performance. 

Matched Items 

There are 11 pairs of matched items. As on the third- 
grade test, these are mr.tched computations and applications 
using the same skills and the same numerical values. 



Table 7 

Average Correct Performance of Boys and 
Girls on Matched Items: Sixth Grade 

Computations Applications 
Boys 70.51 63.48 

Girls 72.65 63.63 



At the sixth grade, girls perform significantly better than 
boys on computations, but there is no difference in 
per'^ormance on applications. This means,- in effect, that 
there is a larger discrepancy between girls' performance on 
the two types of items tn-n between boys ' performance on the 
two. This finding is supported by previous research that 
found this discrepancy to be statistically significant 
(Marshall, 1984). 

The matched-item data is plotted in Figure 4. There are 
fewer matcned items at sixth grade than at third, but the 
linear trend is nonetheless apparent. The regression 
equations for predicting application performance from 
computation performance for boys and girls are: 

A = 0.6?8C + 18.482 

and 

A = 0.664C + 15.414 

respectively. Tne difference in intercepts corresponds to 
the difference in mean porf o. .lance discussed above. As ir 
the third grade results, tesvs of the regression equations 
are significant (F = 21.58, df = 1,9; f = 20 .65 , df = 1 , 9 ; p 
< .001), and the two coefficients are not significantly 
different from each otner. 



ERIC 



22 




Rank order by Boys 



C Cjcnputation 

K Counting/Number Property 

W Word 



V Visual 

M Geafetxy/Measurement 
N Nontraditional 



Figure 3: ^ Ccrcarison of the Most Difficult Ite:ns for Boys and 3irls: Grade 6, 



ERIC 



23 



2q 



3 00% 



95%. 
90% 
85% 
80% 
75%. 
70% 

S 

o 65%. 
m 

•H ro% J 

55% 
50% . 



40% . 
35% - 

30% 



0 

X X 



0 



0 X 
X 



X 



0 



8 



X 



1 1 1 1 1 I f r , . . • • 

30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85'-o 90% 95% 100% 



Ccxnputations 



X Male 
0 Foiale 



Figure 4: Performance of Boys and Firls on Matched Items of 
Ccrputations and J^lications: Grade 6. 



2.) 



ERIC 



24 



Analysis of Errors 



,o r^^=r•ihle attempted to map the 

'"r*"fr.^ Ihl sTKth-grVa« ttens into the categories 
distracters from the sixtn results are given in 

defined for the tnird-graae items Tne resu^^ both'grades, 
Table 4. Several categories were u^^^ ^^1^3. 
particularly distracters ^°J^^|P°J^'"^the s.T.allest froa> the 
only erroneous rules of ^u^^^tra^^^^^ carrying/borrowing 
largest value ^^garaless of placement a ^ remaining 

errors are evaluated ^JL^^^'Jar as distracters at 

five rules from third grade '^i^"°J,^PPas added at the sixth 
the sixth grade, A new erroneous rule was^a^^^^^^ ^^^.^^ 

fractions. 

. u i-hf^r^ were insuf f icit^nt numbers to 

At the sixth grade, tnere were in ^^^^^^ 

evaluate differences ^^^f ^^/^/^^^f of interference in 
literal translations ana on errors o^ ^^^^^ n^arginally 
computing series. Tne first or ^.^^^^.^ ^^^^ 

significant at the third graae, with gi^^ revealed no 
boys to make the error, and tne ^^^^^^ 
difference between ooys g\ade as well, but the 

pursue these differences at the sixtn gr 
data do not allow it. 

The different errors tested in ^^'^^J^^,^;:;^ 
into six categories in Table 8 The evaluated. The 

have only one erroi. ea^h .nat ^ ^^^^^^^ ty several 
remaining three "J^^ories are ^nar ^.^^.^ category, 

different errors. When these are comoine ^^^^ 
it is possible to evaluate ^^^„^,^^%^?°y%rassoc^ and 
iikely to make errors of erroneous rule of as ^ .^^^^ 

oi attention. As mentionea abov^'^^°y^ are more likely 

make errors from erroneous rules ana g^ ^^^^ ^^^^^^ 

make errors of association While cni ^^^^ . ^ ^^^^^ 

make errors of attention, the nature of t ^^Iti-step 

distinct. Girls ^enf^ to leave ou ^^.^ error is 

fiJ^S lo"L.V\u""e\,ye"/l^f ttUrttonai errors than are 



girls (p < -05) . 



Table 8 

Distracter Analysis Sixth Grade 



TYPES OF DISTKACTERS 



NO. OF 
HAVING 



ITEMS WO. OF TIMES 
THIS BOYS MAKE 



NO. OF TIMES 
GIRLS MAKE 



I. LANGUAGE ERRORS 
A. Opposites 

II. SPATIAL U'>iD£RSTANDING 
A. Spatial Reversals 

III. MASTERY 

A. Wrong jperation 

IV. ASSOCIATION 

A. Key Words 

B. Number Patterns 

V. ERRONEOUS RULES 

A. small from Large 

B. Carrying Errors 

C. Borrowing Errors 

D. Numerical Reversals 

E. Fraction Addition 

VI. LACK OF ATTENTION 

A. Omit a Step 

B. Careless Transcription 

C. Perseverance 

D. Partial Reading 

E. Interference: Formulas 



*** 



10 

,05 



:ter 


ERROR MORE 


ERROR 


MORE 


THAN GIRLS 


THAN 


BOYS 


17 


\. 


7 




17 


10 


7 




10 


8 


2 




10 


8 


2 


•k it 




32 


34 




66 


32 


34 




39 


14 


25 




12 


3 


9 




2-1 


11 


16 




55 


44 


1 i 


•kiiii 


11 


10 


1 


* * * 


8 


8 


0 


*** 


19 


14 


5 


* 


•a 


C 
-J 


4 




8 


7 


1 




10«=^ 


60 


46 




35 


10 


25 




11 


10 


1 


* * * 


29 


22 


7 


* ** 


19 


11 


8 




12 


7 


5 





p < 
p < 
p < 



20 
10 
05 



3i 



26 



Thirteen distracters are evaluated in Table 8 . Of 
these, seven are significantly different for boys and girls 
beyond the .05 level of probability, and two others ace 
maiginally significant. Boys were more likely than girls to 
make errors caused by erroneous rules; all hypotheses tested 
in this category were highly significant. Girls were more 
likely than boys to make errors of attention and/or 
association, including focusing on key words, using number 
patterns within the problem, and omitting a needed step in 
the calculations. Boys also showed a propensity for errors 
reflecting lack of attention. In particular, they were more 
likely than girls to err by careless transcription of 
numbers and perseverance in carrying out the same procedure 
multiple times within a problem. 



o 

ERIC 



27 



RESULTS OF THE LONGITUDINAL STUDY 

As described earlier, the sixth-grade students studied 
here are the same students who responded to the California 
Assessment Test at the third grade. Their performance at . 
the two grades is assessed by two sets of analyses: 
comparisons of performance at third and sixth grade on the 
general categories described in Table 1 and comparisons of 
performance on specific items of the two T:ests. The first 
set of analyses is based on the entire population of 
students responding at third grade and at sixth grade. The ^ 
second set is based upon a subset of students whose 
responses at third grade could be matched uniquely xith 
their responses at sixth grade. y 

Analyses Using the Entire Population 

The purpose of this section is to tie together the 
findings of the two previous sections. In those sections, 
the two grades were treated separately. The objective here 
is to describe the continuities and discontinuities in 
student performance over the three year span. As before , the 
focus is on' correct performance anJ on types of errors 
committed. 

C orrect Performance 

Tables 2 and 6 provide information about boys' and 
girls' success in solving six types of 
problems: coa.putations , visual items, counting/property of 
numbers items, word problems, geometry/measurement items, 
and nontraditional problems. These were discussed in the 
previous two sections of this report. 

The range of percent correct at the third grade is 
10.54 for boys and 11.28 for girls. At the sixth grade, the 
range for boys is b . 65 and for girls is 10.69 . Clearly, 
there is greater similarity between boys and girls at the 
third than at the sixth grade. 

Rank difficulty of items for boys and for girls did not 
change significantly from third to sixth grade. At both 
grades, word probleius were zhe most difficult items for boys 
and for girls. Computation, visual, and counting items were 
consistently the easiest. There was no sex-related 
difference in rank order and no change over time. 

Comparisons were made at both grade levels of the 
performance of girls and the performance of boys on each 
item category. Recall that at the third grade, girls 
performed significantly better than boys on three of the six 
categories: computations, nontraditional items, and counting 
items. Boys demonstrated no significant superiority on any 
category. At the sixth grade, girls continued to perform 
better than boy3 on items of coraputations but lost ground on 
four of the six categories. First, they lost the advantage 




28 



3d 



of counting and 



areas ot couui.*"^ 
ae^cnstrated -^'^^I'.^'e^^^re'no gender difte-n|^^^^^^^^ 
nontraaitional items Th^^^ ^^^^i,f"Jman^e with girls 

performance at ^J^^^" J^ately equal Pe^io™!'^ areas at 
Lys moved ^/^^^^^P^^^i^^ror performance ^^^Jf ^%t:?isticallV 
at third grade to sup^^ differences were 



arade. These 
sixth graae. 

significant (p < 



significant (p < araphically in Figure 

5. Plotted in ^^-^ategory at each ^^t^^ion evidence a 

Shift in performance to bo^ 



the sixth grade. 

irror 

4 contain details 



^^j^^g^^isons of Errors of the comparisons 



^^"^^ ^ ^"\r?s fcf t;;e //"/"on Of Particular typas 
third and sixth grades. 

be asked: ^i^.^ grade 

?r.^.i"sre"r,?enr:rf./s.«. ,raae, 
present t^o uii^ 

, -. ..,.*=r« is there increased 
ri^£:5enUaUorc;-;he basis c£ 9enuex . 

Ie"senin9 the 



this type. 



ERIC 



34 

29 



•Hiird 



* 1 f ; i i • 

C K W V M N 

Item Categories 

C Computation V Visual 

K Counting/Number Propertj.es M Geometry/Measurement 

W Wbrd N Nontraditional 



Figure 5: Relative Change in Boys' and Girl^ Perfonnance 
from Uiird to Sixth Grade. Ebr each item 
category at each grade, the average p-value for 
girls is subtracted from the average p-value 
for Doys. 



3d 



30 



Table 9 

Coraparisons of Distracter Choices: Third frid Sixtn Grades 

Distracter 



Result 


Category 


Comments 


Signii iCdn t 




who 




to same degree: 






Girls 


Key Wordo 










Boys 




^ubtrac*" Small 




Boys 




oori owiny 




Boys 








Boys 








Boys 




Perseverance 




Boys 


Increased 




Who 


Increased 


differentiation: 




No. 


of Errors: 




Omit a Step 




Girls 


Decreased 




Who 


Decreased 


differentiation: 




No. 


of Errors: 




Numoer Patterns 




Girls 




Opposites 




Boys 




Numerical Reversals 




Boys 



Ku uilf erentiation 
at either grade: 

Wrong Operation 
Partial Reading 
Interference 



3d 



ERIC 



31 



Comparisons Based Upon Subset of Population 

For these analyses, we matched the responses of 
students who responded to one of the third-grade tests with 
their corresponding responses to one of the sixth-grade 
tests. We were able to isolate matching data for 
approximately one-third of the total population, resulting 
in a subset of about 1U0,000 students. Since there are 30 
distinct forms of the third grade test and 40 forms of the 
sixth grade tests, we have between 80 and 90 students 
responding to any pair of third-sixth tests and consequently 
to any pair of third-sixth items. 

The analyses described in this section are based upon 
comparisons at the item level. There are two sets of 
analyses. The first of these compares correct-incorrect 
responses at both grades upon items having matched content. 
The second compares selection of the same distracter on 
matched third-sixth grade items. 

Corr ect ^^ersus Incorrect Performance 

The issue for t'^ese analyses is to determine whether 
girls and boys mainta performance in areai covered on both 
tesf. That is, is tiiere any in or loss from third to 
sixth grade in areas that are tajght at both grade levels. 
We focus here on content areas that are narrower than the 
broad categories discussed above (questions about odd and 
even numbers would be one such area). The comparison of 
interest is illustrated by a t^o-hy-two table of correct and 
incorrect performance: 

Sixth-Grade Item 
Correct Incorrect 

Third- Correct | I I 

Grade - - 7 

Item Incorrect | I i 

Obviously, there are four cells of the table corresponding 
to correct performance on an item at both grades, incorrect 
performance at both grades, correct at third but incorrect 
at sixth grade, and incorrect at third but correct at sixth. 
For every matched pair of items (and for every item repeated 
on both sixth and third grade tests), a simple contingency 
table can be constructed for boys and girls. Within a 
particular content area, one can aggregate frequencies from 
all matched iteirs. The test of interest is a chi-square 
test of distribution: testing whether the distribution of 
boys over the four cells is similar to the distribution of 
girls. 

The narrow content areas investigated here are given in 
Table 10, together with a brief description of each. 

ERIC 32 



Table 10 



Specific Content Areas Present on 
Third-Grade and Sixth-Grade Tests 



Content Area 

1, Place Value I 

?. Place Value II 

3. Place Value: III 

4 . Odd/Even 

5. Identify Question 

6. Fraction Shaded 

7. Identify Function Rule 

8. Find Missing Number 



9. 
10 . 
11. 



12. 
13. 

14. 
15. 



Geometry: Parallel Lines 
Geometry; Line Segments 
Geometry : Perimenter 

Geometry: ^4raphs 

Fill in the Box 

Words to Numbers 
Equation to Problem 



16. Problem to Equation 



17. 


Word 


Problems 


— Change 


18. 


VJord 


Problems 


— Combine 


19. 


Word 


Problems 


— Compare 


20. 


Word 


Problems 


Vary 


21. 


Word 


Problems 


— Transform 



Brief Description 

Identify the digit in a 

specified place value. 
Recognize the place value 

of a specified digit. 
Write number in expanded 

notation using place value. 
Identify odd and even 

integers . 
Recognize restatement of 

the question asked in word 

problem. 
Identify the fraction 

shaded in a figure. 
Recognize relation between 

X and Yr given multiple 

numerical examples of each. 
Find value of X, given Y and 

multiple instances sho./ing 

relation between X and Y. 
Identify parallel lines. 
Identify line segments. 
Find the perimeter of a 

figure. 
Identify coordinates in a 
graph. 

From a worked-out computation, 
find missing value that ^s 
represented by a box. 

Translate written numerical 
statement to numbers. 

Given numerical equation, 
identify appropriate word 
problem that matches it. 

Given word problem, identify 
corresponding mathematical 
expression or equation. 

permanent alteration in some 
set. 

Two distinct sets are joined 

into a conceptual superset. 
Contrast the difference 

between two quantities. 
Direct variation of one 

quantity with a second one. 
Expressing a quantity in a 

different scale/dimension. 



ERIC 



33 



3d 



Two questions were asked of each content area. First, 
did boys and girls differ in their progressions in these 
areas? This corresponds to the comparisons of percent 
correct of boys and of girls at the third grade with percent 
correct of boys and girls at the sixth grade. The second 
questions is whether the distributions of boys and girls 
differ in the two-by-two contingency tables. 

The answers to the first question are given in Table 
11. The performance of girls relative to that of boys drops 
in 9 of 21 areas (43 percent). The performance of boys 
relative to that of girls drops in 5 of the areas (24 
percent). The difference between boys" and girls" 
performance remains stable from third to sixth grade for the 
remaining 7 areas (33 percent). 

Most of the areas in which girls appear to lose ground 
are from the counting/property of numbers category. As 
pointed out previously, this is an area in which girls 
performed significantly better than boys at the third grade 
but not at tne sixth. The analyses of Table 11 illustrate 
the particular difficulties that were experienced by girls. 

There is no clear pattern to the relative loss by boys. 
Two of the five areas are geometric, and two are types of 
word problems. Boys scored significantly higher on both 
general categories than did girls at the sixth grade. 

Rather than look at these differences as relative 
losses, we can view rhem as relative gains. Thus, boys made 
relative gains in the area of counting/property of numbers 
from third to sixth grade. Similarly, girls made relative 
gains in two areas each of geometry and word problems. 

Knowing whether one sex makes relative gains does not 
answer the question of whether girls and Doys are responding 
in approximately equal ways to the pairs of items. In 
particular, it does not provide any information about the 
distribution of student responses over the four cells. Table 
12 contains the results of chi-square tests of distributions 
for the 21 content areas. 



3;i 



ERIC 



34 



Table 11 



Comparison of the Relative Gains by Boys and Girls 
from Third to Sixth Grade 

GIRLS LOSb GROUND: No difference at third ^rade; boys 

surpass girls by at least 5% at sixth, 

1 . Place Value I 

2. Place Value II 

3. Place Value III 

10. Geometry: Line Segment 
13. Fill in the Box 

Girls surpass boys by at least 5% at 
third grade; no difference at sixth. 

4 . Odd/Even 

5. Identify Question 

6. Fraction Shaded 

15. Equation to Problem 



BOYS LOSE GROUND: No difference at third grade; girls 

surpass boys by at least 5% at sixth. 

7. Identify Function Rule 
19. Word Problems Compare 
21. Word Problems — Transform 

Boys surpass girls by at least 5% at 
third grade; no difference at sixth. 

11 . Geometry: Perimeter 

12. Geometry: Coordinate Graphs 



NO CHANGE: Boys and girls approximately equal 

at both grades. 

8. Find Missing Number 

9. Geometry: Parallel Lines 

17. Word Problems — Change 

18. Word Problems — Combine 
20. Word Problems — Vary 

Boys surpass girls by more than 
5% at both grades. 

14. Words to Numbers 
16. Problem to Equation 



ERIC 35 



Table 12 



Comparisons of Boys" and Girls" 
Responses to Matched Pairs of Items 

Level of 





Content Area Chi 


-Square 


Significance 


. • 


Place Value I 


3 . 03 




2. 


Place Value II 


4.17 




3. 


Place Value: III 


4 . 68 




4 . 


Odd/Even 


18 . 36 


it if it 


5. 


Identify Question 


16 .05 


it it it 


6. 


Fraction Shaded 


27 . 24 


it it ^ 


7. 


Identify Function Rule 


12.79 


it it it 


8. 


Find Missing Number 


4 .73 




9. 


Geometry: Parallel Lines 


8 . 07 


it it 


10 . 


Geometrv: Line SeQment 


2 .97 




11. 


Geometry : Perimeter 


7.38 


f 


12. 


Geometry: Coordinate Graphs 


7.34 


if 


13. 


Fill in the Box 


7.07 


it 


14 . 


Words to Numbers 


14.45 


it it it 


15. 


Equation to Problem 


5.69 




16. 


Problem to Equation 


15.27 


it it it 


17. 


Word Problems — Chcmgo 


10.11 


* * 


18. 


Word Problems — Combine 


4.57 




19. 


Word Problems Compare 


7.85 




20. 


Word tiToblems — Vary 


77.38 


★ 


21. 


word Problems — Transform 


14.22 






* p < .10 Marginally Significant 






** p < .05 Significant 








*** p < .01 Significant 







Eleven of the 21 tests yield results that exceed 
conventional levels of statistical significance. An 
additional 3 are marginally significant The important 
questions in these tests are whether girls and boys are 
equally likely to miss both items and whethe'^ they are 
equally likely to solve correctly the third grade item and 
err on the sixth grade one. A large number of i^tudents were 
unable to solve either the third or sixth grade item in all 
categories . Over ten percent of all students solving items 
in 13 of the '^1 categories missea both items. Girls were 
more likely tnan boys to . exhibit this pattern of 
response. In particular, on two categories, identifying the 
function rule (7) and soJvirg vary word problems (20), over 
20 percent of the girls failed to soWe the items at either 
grade. 

4i 



ERIC 



36 



Six of the significant findings of Table 12 are in 
areas related to solving word problems: identifying a 
restatement or the question asked in a word problem (5), 
recognizing the equation or expression that corresponds to a 
word problem (16) r and solving four categories of word 
problems. A closer examination of the distribution.^ of boys" 
and girls' responses in these categories indicates that the 
major differences are in the correct-correct and correct- 
incorrect cells. Girls are more likely to be correct on both 
pairs of items than are boys for four of the areas. Boys are 
more likely than girls to be correct on the third-grade item 
and incorrect on the same type of sixth-grade item. This 
pattern was observed in five of the six areas. 

Persistence in Making the Seime Error 

One go- ' of the research described in this report was 
to -examine ^ extent to which children continue to make the 
same types or errors over time* This issue was addressed by 
taking the five categories of word problems described in 
Table 10 and examining student performance on those which 
had parallel distracters. For example, there were change 
vord problems at both grades that required addition for the 
correct solution and offered the distracter of subtraction. 

Analyses similar to those described above were carried 
out. For each set of matching items , the following two-by- 
two contingency table could be developed: 

Sixth-Grade Item 
Correct Distracter 



Third- Correct | | | 

Grade 

Item Distracter I | | 



There are two questions of interest. First, are there 
significant differences in the distributions of boys' and 
girls " responses to these items? Second, are boys or girls 
more likely to show persistence in making the same error? 

Fourteen sets of items were identified . Each set 
allowed the contrast of correct performance with the same 
distracter . A large majority of the third-grade items were 
simple word problems involving a single arithmetic 
operation. These items were matched with a similar set of 
sixth-grade items also requiring only one computation^ In 
all cases , the matching distracters corresponded to 
computation using the same incorrect arithmetic operation. 

Of the 14 comparisons, 4 were statistically 
significant, with a probability level smaller than .05, and 
3 were marginally significant, having a probability less 
than .10. The remaining 7 tesi:F rev^-^^led no sex differences. 



Only compare items were nonsignificant. There were 
significant differences in performance on the remaining four 
types of items. 



ERIC 



4 J 



38 



SUMMARY 

It is clear from the preceding analyses that boys and 
girls differ in performance at both third and sixth grades. 
Girls appear to be stronger than boys in mathematics at the 
third grade. This is evidenced in Table 2 and the analyses 
pertaining to ^hat table. Girls have higher probabilities of 
successful performance on 5 of the 6 categories of items. 
Three of these differences are statistically significant. 
This finding has been observed by the California Assessment 
Program in other cohorts of students as well (CAP, 1982). 

These findings contradict other research on gender- 
related differences in mathematics performance (e.g., Leder, 
1982; Benbow & Stanley, 1982). In very few instances have 
girls been reported to have higher achievement than 
boys . There are several points to be made in this regard . 
First, we can be very sure of these results. The data are 
not a sample from a specific population; the entire 
population of third-grade students in California was 
examined. The test itself is a broad-range instrument 
containing 360 items . Thus , the present results cannot be 
dismissed as artifacts of sampling either from the 
population of students or from a limited range of items. 

We gain some information about why girls have higher 
performance than boys from the analyses of errors. Most of 
the statistically significant findings relate to boys' 
tendencies to apply erron*=*cus arirhmetic rules to 
mathematics problems (see Ti^^'^e 4). \ve hypothesize that 
girls are more likely to o-'^elop and use tlie rules of 
mathematical coraputatior . Confirmation of this hypothesis 
requires additional research on children 's abilities to 
identify, formulate and d. -criminate among different rules 
or algorithms. Tl-^^re al&o seems to be an element of 
attention to detar.l reflected in the analyses, particularly 
in the tendency of boys to make careless errors of spatial 
or numerical reversals . Again , further research is required 
to determine whei:ner this difference is atttentional , 
developmental, or gender-related. 

In the sixth-grade results, we observe more traditional 
results. Boys seem to do better than girls. Girls have 
higher probabilities of success on only two of the six 
categories of items. Previous research on an earlier 
version of the California Assessment Test indicated that 
girls were more likely than boys to solve computational 
items correctly and boys were more likely than girls to 
solve word problems correctly (Marshall, 1984;. These 
results were replicated here. However , two additional 
findings of the present research complicate a simple 
interpretation of the computation/word problem results . 
First, boys and girls demonstrate approximately equal 
understanding of counting principles and properties of 
numbers. Thus, girls may be more likely than boys to carry 
out computations correctly, but they are not more likely 

ERLC 39 *^ 



demonstrate understanding of the computations. This 

suggests again a dependency upon rules or algorithms for 

computations with or without a clear understanding of what 
the rules nean. 

The second additional finding is that girls are more 
likely than boys to solve nontraditional word problem 
corr^^ctly and less likely to solve traditional ones 
correctly. The primary distinction between these two types 
of item:, is that in the former students are not asked to 
reach a numerical solution. They are expected to analyze a 
problem, interpret intermediate steps, identify opera-cions 
that will be required, and so forth. The fact that girls 
consjitently have higher performance than boys on problems 
of this type (see CAP, 1982) suggests that girls do indeed 
have the capability of understanding what is happening 
within a word probleir.. Why, then, do they perform more 
poorly than boys on word problems? There are several 
possible explanations. One is that girls develop fif^erent 
?eading skills for mathematics problems. They may engage in 
spot reading or in searching for selected words in the text. 
When asked to produce a novel response as in the 
nontraditional items, they may change their reading styles. 

A second reason for the difference in performance is 
the rule argument presented above. Using rules is essential 
for solving computations correctly. It nay be natural for 
girls to expect that rules also govern word problems and to 
develop rules that can be applied to sucn proolems. 
Certainly at the very simplest level, there are a few rules 
that may be invoked, such as the word "altogether generally 
means that addition will be required. The problem with this 
strategy is that it cannot generalize ts^ complex problems 
requiring several arithmetic computations. 

A disturbing result of this study is the comparison of 
boys' and girls' responses to third and sixth grade items ot 
the same type (see Tables 10 and 11). It is here that we see 
more clearly what is happening in mathematical sKiii 
aevelopment between these two grades. Girls either lose 
c.ronnd or fail to maintain equal performance with boys on 
approximately one-half of the subcategories studied. This is 
roughly twice as many categories as those showing declining 
performance by boys. 

These results sugqest that teacher-; may need to address 
specific deficits, particularly in the ande.lying principles 
and concepts of mathematics. It may be that boys and girls 
require additional instruction or elaboration in different 
areas. In particular, it may be necessary to provide 
instruction about how *-.o read "mathematically . in 
traditional wotd problems, r >ny inferences must be made- At 
this point, we have no information about gender-related 
differences in drawing such inferences. We do know tnat 
word problems are very difficult for both boys and girls 
(from the rank orders of Tables 2 and 6), and we also know 



that they are significantly more difficult for girls than 
for boys. The next step must be to evaluate whether 
differences in the ability to read mathematically can 
account for tne results observed here. This should be a 
fruitful area of researcn and may help us understand better 
how students solve mathematics problems. 



4d 



REFERENCES 



Benbow, C. & Stanley, J. (1982). Consequences in high school 
and college of sex differences in mathematical 
reasoning ability: A longitudinal perspective. American 
Educational Research journal , 19., 598-622. 

California Assessment Program. (1981). Student Achievemen t 
in California Schools, 1980 -81 Annual Report. 
Sacramento: California State Department of Education - 

California Assessment Program. (1982). Student Achievement 
in California Schools, 1981 -82 Annual Report . 
Sacramento: California State Department of Education. 

Leder, G. C. (1982). Mathematics achievement and fear of 

success, journal for Research in Mathematics Education, 
12, 124-135. 

Marshall, S. P. (1984). Sex differences in children's 

mathematl'-s achievement: Solving computations and story 
problems. Journal of Educational Psycho logy, 76, 194- 
204. 

Marshall, S. P, (1983). Sex differences in mathematical 

errors r An analysis of distracter choices. Journa.l for 
Research in Mathematics Education, ^4, 325-336. 

Marshall, S. P. (1982) . Sex differences in solving story 
pr oblems : A study of strategies and cognitive 
processes . "Final Report, NIE-G-80-0095 , The National 
Institute of Education., 



4/ 

42 



APPENDIX A 



Sample Items From California Assessment Program Tests 
From Tnirci and Sixth Grades 

PROBLEM TYPE GRADE EXmPLE 



Computation 



Counting 



Third 




740 








-672 






{* 


) 68 






{ 


) 7R 






{ 


) 132 






{ 


) 141? 




Sixth 


1/5 + 3/4 =■ 






{ 


) 4/9 






{* 


) 19/20 






{ 


) 4/20 






{ 


) 3/20 




Thitu 




345 = 






{ 


) 3 + 


4 + 5 




{ 


) 400 + 


30 + 5 




{ 


) 400 + 


50 + 3 




{* 


) 300 + 


40 + 5 


Sixth 


To 


find the 


difference 




83 


and 18, you: 




{ 


) add 






(* 


) subtract 




{ 


) multiply 




{ 


) divide 



43 



Appendix A: continued 



Word Problems Third Ron had 7 peanuts. Sue 

had 2 times as many peanuts 
as did Ron, How many 
peanuts did Sue have? 

( ) 21 

( ) 16 

(*) 14 
( ) 9 

Sixth It is 1.3 kilometers from Sharon's 

house to school. She rides her 
bicycle to and from school every 

day. How far does she rida in 
5 days? 

( ) 6.3 kilometers 
( ) 6.5 kilometers 

( ) 10 kilometers 

(*) 13 kilometers 



Kim had 4 apples • 

Sho ate 3 . 

How many were left? 



Which question is asked? 

( ) Did Kim have 4 apples? 

(*) How many were left? 

( ) How many apples did Kim eat? 

( ) Did Kim have 3 apples left? 

Sixth The 130 students from Marie Curie 
School are going on a picnic in 
Carson Park. Carson Park is 12 
miles from the school. Each bus 
holds 50 passengers. How many 
buses are needed? 

Which numbers are needed to solve 
this problem? 

( ) 130 and 12 

(*) 130 and 50 

( ) 12 and 50 

( ) 130, 12, and bO 



Nontraditional 

Problems Third 



Appendix A: continued 



Geometry Third This shape is: 

/Measurement 

{ ) a circle 
( ) a square 
'*) a triangle 
( ) a rectangle 




Sixth A hand is used to measure the 

height of a horse. The hand is 4 
inches long. How tall is a horse 
that measures 15 1/2 hands? 

( ) 15 1/2 inches 

(*) pbout 5 feet 

( ) about 62 feet 

( ) 15 1/2 feet 



Visual Third Jenny was saving pennies. She 

put them in bags of 10 's and 
IGO's. How many pennies does 
Jenny have? 

( ) 423 
{ ) 342 
(*) 324 
{ ) 304 

Sixth Happy and Al lived the same 

distance from the school but in 
opposite directions. They 
found that they lived 500 meters 
apart. Which drawing shows this? 





( ) School 

{ * ) Happy^ Scnpol ^Al 

* * 250 * 

i ) Happy^ School Al 

500 500 * 

{ ) School Happ y ^Al 

250 ^ 250 * 



These items are reproduced here with the permission of 
the California Assessment Program. 



